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ABSTRACT 


Analysts  conducting  experiments  must  frequently 
deal  with  situations  in  which  data  is  incomplete  or 
.missing.  This  creates  problems  that  can  seriously 
affect  classical  hypothesis  testing  by  introducing 
extraneous  terms  into  the  hypothesis  in  a complicated 
way.  A technique  exists  that  allows  an  anal-st  to 
determine  precisely  which  experimental  terms  are 
actually  present  in  a proposed  hypothesis  and  what 
that  hypothesis  would  actually  be  testing  if  employed. 
This  paper  examines  the  mathematics  underlying  the 
technique  and  applies  the  theory  to  a widely  used  data 
analysis  computer  package.  A computer  program  is 
presented  to  facilitate  implementation  of  the  method. 
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I 


EXECUTIVE  SUMMARY 


A.  PROBLEM 


At  times  even  the  most  carefully  designed  and  executed 
experiments  can  be  plagued  with  aborted  tests  or  missing 
data.  Such  unbalance  in  the  data  can  have  a significant 
impact  upon  the  mathematical  structure  of  analytic 
techniques  used  in  analysis  of  variance.  In  addition  to 
increasing  the  complexity  of  computations,  unbalanced  design 
can  also  seriously  affect  hypothesis  testing.  Because  of 
lack  of  balance,  hypotheses  purporting  to  test  the  influence 
of  a main  effect,  for  example,  may  be  hopelessly  confounded 
with  interaction  terms.  Blindly  •'testing"  such  confounded 
hypotheses  without  an  appreciation  of  the  level  of  pollution 
from  extraneous  terms  can  lead  to  serious  error  in 
interpreting  results.  It  is  desirable  to  find  a general 
procedure  for  use  with  analysis  of  variance  that  can 
determine  exactly  what  a proposed  hypothesis  is  testing  in 
terms  of  the  main  effects  and  interactions. 


B.  APPROACH 


Because  of  its  mathematical  power  and  notational 
simplicity,  the  matrix  form  of  the  linear  model  Y = Xb  ♦ e 
is  used  in  deriving  a solution  to  the  problem.  The  linear 
model  leads  to  the  "normal  equations"  X'Xb  = X'Y.  Since  X'X 
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is  in  general  not  of  full  rank,  any  solution  (b  ) for  b is 

-1 

not  unique.  Further  (X'X)  does  not  exist;  one  must  turn  to 
the  concept  of  a generalized  inverse  G of  X'X.  It  can  be 
shown  that  testing  • a hypothesis  : q'b  = m involves 

expressing  the  hypothesis  as  a linear  function  g'GX'X  of  the 


generalized  inverse  (G)  and  X'X.  While  determination  of 
g'GX'X  is  frequently  a non-trivial  manual  calculation,  it 
can  be  handled  easily  on  a computer. 


C.  SOLUTION 

If  an  analyst  needs  to  test  a particular  hypothesis  it 
is  possible  that  additional,  undesired  terms  may  be 
polluting  the  hypothesis  to  such  a degree  that  his 
interpretation  of  test  results  may  be  completely  invalid.  By 
computing  the  value  of  g'GX'X  he  will  be  able  to  determine 
precisely  what  his  proposed  hypothesis  is  actually  testing. 

D.  CONCLUSIONS 

Recognizing  that  an  unbalanced  design  can  lead  to 
difficulty  in  interpreting  traditional  tests  of  hypotheses, 
it  is  concluded  that: 

1 . it  is  mathematically  possible  to  determine  the  exact 
nature  of  a proposed  hypothesis,  and 

2.  such  a determination  is  feasible  using  a computer. 
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II.  MATHEMATICAL  justification 


A.  GENERALIZED  INVERSE 


A generalized  inverse  of  a matrix  A is  defined  to  be  any 
matrix  G satisfying 

AGA  = A. 

It  can  be  shown  that,  for  a given  matrix  A,  G is  in  general 
not  unique  [Searle,  1971], 

B.  SOLUTION  OF  CONSISTENT  LINEAR  EQUATIONS 

The  system  of  linear  equations  AX  = Y is  consistent  if 
any  linear  relationships  existing  among  the  rows  of  A also 
exist  among  the  corresponding  elements  of  Y.  Since  linear 
equations  have  a solution  if  and  only  if  they  are 
consistent,  the  procedures  outlined  below  are  confined  to 
systems  of  consistent  linear  equations. 

The  following  theorems  from  Searle  [1]  are  stated 
without  proof  in  order  to  develop  solution  procedures  for 
consistent  equations. 

Theorem  1.  Consistent  equations  AX  * Y have  a solution 
X - GY  if  and  only  if  AGA  * A. 

Theorem  2.  If  A has  q columns  and  if  G is  a generalized 
inverse  of  A,  then  the  consistent  equations  AX  * Y have  the 
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solution 


0 

X = GY  + (GA  - I)  Z 

where  Z is  any  arbitrary  vector  of  order  g.  The  notation 
0 

indicates  that  X , which  satisfies  AX  = Y,  is  a solution  and 
not  the  general  vector  of  unknowns  X. 

Theorem  3.  For  the  consistent  equations  AX  = Y,  all 

solutions  are,  for  any  specific  G,  generated  by 
0 

X = GY  ♦ (GA  - I)  Z,  for  arbitrary  Z.  That  is,  one  need 
derive  only  one  generalized  inverse  of  A in  order  to  be  able 
to  develop  all  solutions  to  the  system  AX  = Y. 


C.  THE  SPECIAL  CASE  OF  SYMMETRIC  MATRICES 


The  linear  model  used,  inter  alia,  in  analysis  of 
variance  involves  the  system  of  consistent  linear  equations 

X'Xb  = X'Y 

that  are  solved  for  b.  It  is  therefore  worthwhile  to 
consider  the  special  case  of  the  symmetric  matrix  X'X  in 
some  detail.  The  following  development  is  from  Searle  [1]. 

Lemma  1.  X'X  = 0 implies  X * 0. 

Proof:  This  is  true  because  if  X'X  = 0,  the  sums  of  squares 
of  the  elements  of  each  row  equal  zero,  hence  must  be  zero. 

Lemma  2.  PX ' X = QX'X  implies  that  PX'  = QX' . 

Proof:  ApFly  Lemma  1 to  the  identity 

(PX'X-QX'X)  (P-Q)  ' = (PX'-QX')  (PX'-QX')'  = 0. 

That  is,  (PX'-QX')  (PX'-QX')  • = 0 implies  that  (PX'-QX')  = 0 
which  implies  that  PX'  3 QX ' . 
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then 


Theorem  4.  When  G is  a generalized  inverse  of  X'X, 

i.  G'  is  also  a generalized  inverse  of  X'X; 

ii.  XGX'X  = X (i.e.,  GX'  is  a generalized  inverse  of  X)  ; 

iii.  XGX 1 is  invariant  to  G. 

Proof : 


(i)  By  definition 

X'XGX’X  = X'X 

transposing  X'XG'X'X  = X'X  establishing  (i) . 


(ii) 

by  Lemma  2 
transposing 


X'XG'X'X  = X'X 
X ' XG  ' X ' = X ' 

XGX'X  = X establishing  (ii) . 


(iii)  Suppose  F is  some  generalized  inverse  different 
from  G.  Then  XGX'X  = X = XFi'X.  3y  Lemma  2 XGX'  = XFX'. 
That  is,  XGX'  is  the  same  for  all  generalized  inverses  of 
X’X,  establishing  (iii). 


D.  THE  LINEAE  MODEL 


The  general  linear  model  is  Y = Xb  ♦ e where  X is  an 
n x 1 vector  of  observations  whose  components  are  random  and 
observable;  X is  a n n x p matrix  of  experimental  design 
whose  components  are  real  and  known;  b is  a p x 1 vector  of 
parameters  whose  components  are  real  and  unknown;  e is  an 
n x 1 vector  of  experimental  error  whose  components  are 
random  and  unobservable.  The  vector  e is  defined  as 
e * Y - E (Y) 

E(e)  = E(Y)  - E(2(Y))  = 0, 
and  E (Y)  * E (Xb)  ♦ E(e) 

E ( Y)  = Xb. 


2 

Every  element  in  e is  assumed  to  have  the  same  variance  v 
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and  zero  covariance  with  every  other  element,  thus  e is 

2 2 
distributed  (0,v  I)  and  Y is  distributed  (Xb,v  I).  Deriving 

the  normal  equations  for  the  linear  model  yields 

X • Xb  = X'Y 

J 0 

which  can  be  solved  for  b using  the  techniques  of 

generalized  inverses  described  earlier,  i.e., 

0 

b = GX'Y 
0 

and  E (b  ) = E (GX  • Y) 

= GX' 2 (Y) 

= GX 1 Xb 
= Hb 

where  H = GX'X. 
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III.  THE  CONCEPT  OF  ESTIB ABILITY 


A.  ESTIB ABILITY 


As  defined  by  Searle  [1],  a linear  function  g'b  of  the 
parameters  in  b is  estimable  if  it  is  equal  to  any  linear 
function  [t*E(Y)]  of  the  expected  value  of  the  observations 
in  Y.  It  is  important  to  note  that  t'  is  not  in  general 
unique;  the  only  requirement  for  estimability  is  that  such  a 
vector  exist. 


B.  PROPERTIES 


The  definition  of  estimability  leads  to  four 
mathematical  properties  of  immediate  importance; 

(1)  The  expected  value  of  any  observation  is  estimable. 
In  this  case  t'  is  a vector  with  a single  element  equal  to 
one;  the  rest  of  its  elements  are  zero. 

(2)  Any  linear  combination  of  estimable  functions  is 

estimable.  If  q'b  and  r'b  are  estimable,  then  q'b  = t'E(Y) 

and  r'b  * s'E(Y).  Therefore  c q'b  ♦ c r'b  = (c  t'  ♦ 

1 2 1 

c^s')E(Y)  which  is  estimable. 

(3)  An  alternative  form  of  the  condition  of  estimability 
can  be  developed  as  follows.  If  q'b  is  estimable,  then  by 
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definition  g'b  = t'E(Y)  hence  g'b  = t'Xb.  This  oust  hold  for 
all  values  of  b since  the  condition  of  estimability  does  not 
depend  on  a specific  choice  of  b.  This  leads  to  the  result 
g ' = t'X. 


0 0 

(4)  When  g'b  is  estimable,  g'b  is  invariant  to  the 
0 

solution  used  for  b because 

0 0 

g'b  = t'Xb  = t'XGX' Y. 

0 

Since  by  Theorem  4,  XGX'  is  invariant  to  G,  g'b  is 

0 

invariant  to  G and  therefore  to  b when  g'b  is  estimable. 


Herein  lies  the  essential  importance  of  estimability:  if 

0 

g'b  is  estimable,  g'b  has  the  same  value  for  all  solutions 
0 

b . That  is,  an  estimable  function  is  a linear  function  of 


the  parameters  that  is  invariant  to  whatever  solution  is 
0 

used  for  b . 


C.  THE  TEST 


A function  g'b  is  estimable  if  there  exists  some  vector 
t’  such  that  g'  = t'X.  Pinding  such  a vector  t'  may  be  a 
formidable  task  with  a design  of  large  dimensions.  As  an 
alternative,  it  is  possible  to  test  for  estimability  by 
determining  if  g'H  * g'.  Searle  [1]  shows  that  g'b  is 
estimable  if  and  only  if  g'H  = g’,  as  follows. 

If  g'b  is  estimable 

g*  * t'X 
g'H  * t'XH 
g'H  = t'XGX'X 

by  Theorem  4 GX'  is  a generalized  inverse  of  X, 
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hence 


q'H  = t'X 
q'H  = q'. 

On  the  other  hand,  if 

q'  ■ g'H, 

q 1 = q'GX'X 

and  q*  = t'X  for  t'  = q'GX' . 


D.  THE  CONSTBMNED  MODEL 


1 • development 

The  normal  equations  X'Xb  = X'Y  form  a consistent 
system  of  linear  equations  where  X is  of  rank  r<?.  Because 
X ' X is,  in  general,  not  of  full  rank,  there  are  many 
solution  vectors  that  will  satisfy  the  system.  In  order  to 

0 

obtain  a particular  solution  b , additional  constraints  of 

the  form  Cb  = 0 are  often  added  to  the  model.  & commonly 
used  set  of  constraints  satisfies  the  restrictions 

* the  main  effects  sum  to  zero 

* the  interaction  effects  sum  to  zero  across  each 
subscript. 

Adding  the  constraints  Cb  = 0,  where  the  (p-r)  rows 
of  C are  linearly  independent  of  the  rows  of  X,  yields  the 
following  system  of  linear  equations: 


Y 

X 

e 

• • • 
0 

s 

• • • 
c 

b ♦ 

• • • 
0 

The  constraint  matrix  C can  be  used  to  transform  the  design 

v 

matrix  X into  a constrained  matrix  X by  performing  basic 
row  operations  on  the  system  of  linear  equations. 
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unchanged.  The  practical  effect  of  introducing  the 
constraints  into  the  design  matrix  is  to  make  some  of  the 
* 

columns  of  X consist  entirely  of  zeros.  While  b remains 

* 

unchanged,  the  transformation  of  X into  X has  the  effect  of 
•’deleting”  some  elements  of  the  parameter  vector  by  the 
mechanism  of  creating  those  columns  of  zeros.  Once  the 
constraints  have  been  integrated  into  the  design  matrix, 

* 

transforming  X into  X , the  constraints  become  redundant  and 
can  be  removed  from  the  model  by  the  following  technique. 
Let  A be  a (n,n  + p-r)  matrix  such  that 
A = 0 if  i # j 

ij 

A » 1 if  i « j. 

ij 

Then  multiplying  by  A, 


* 


* 

yields  the  constrained  linear  model  Y = X b ♦ e,  which  is 
equivalent  to  the  constrained  system  above. 


Since  e is  assumed  normal  (0,  v I),  Y is  also 

» * * * 
normal;  E(I)  = X b.  The  normal  equations,  X ’ X b = X ' Y,  can 


be  solved  for  a particular  solution  b that  will  also 

* 

satisfy  the  original  normal  equations  X'Xb  = XT.  If  G is 

* * 0 * * 

defined  as  the  generalized  inverse  of  X ' X then  b - G X T 
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and 


* * * 

it  follows  that  G X ' is  a generalized  inverse  of  X . 

* * * * 

Let  H = G X 'X  . 

It  is  stressed  that  this  constrained  linear  model 
was  developed  solely  for  the  purpose  of  finding  some 

0 

particular  solution  vector  b to  the  original  system  of 

* 

normal  equations.  In  the  discussion  that  follows,  X is  the 

same  size  as  X and  the  parameter  vector  b is  the  same  in  the 
constrained  linear  model  as  it  was  in  the  original  linear 
model  Y = Xb  ♦ e. 

2 . example 


As  a simple  example  of  the  development  of  X , assume 
that  the  design  matrix  X is 
10  0 1 

10  10 
110  0 

Let  C be 

0 111. 

Then  J X I is 
I C | 

10  0 1 

10  10 
110  0 
0 111 

Subtracting  the  bottom  row  from  the  top  row  yields 
1-1-1  0 
10  10' 

110  0 
0 111 


17 


which  is 


The  appropriate  A matrix  is 
10  0 0 
0 10  0 

0 0 10 

* 

Multiplying  yields  X 

1-1-1  0 
10  10 
110  0 


E.  BIOMEDICAL  COMPUTER  PROGRAMS 


The  University  of  California  publishes  and  maintains  the 
BIOMED  series  of  standard  data  analysis  packages  for  use  on 
digital  computers  [Dixon,  1976].  One  of  the  programs  within 
the  package,  BMD05V,  performs  computations  for  analysis  of 
variance  with  the  linear  statistical  model.  The  design 
matrix  employed  is  not  the  same  as  the  design  matrix  (X)  in 
that  model  however.  A user  of  BMD05V  is  required  to 
introduce  appropriate  additional  constraints  to  permit 

0 

computing  a particular  solution  (b  ) for  the  parameter 

vector.  It  will  be  shown  that  techniques  applicable  to  the 
design  matrix  X in  the  general  linear  model  can  be  applied 
directly  to  the  BND05V  design  matrix. 
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P.  APPLICATION  TO  BBD05V 


The  constraints,  Cb  = 0,  added  to  the  linear  model  in 
section  D above  are  the  type  used  to  generate  the  BMD05V 

* 

design  matrix.  The  resulting  matrix  X has  the  same  number 

of  columns  as  the  original  design  matrix  X,  but  because  some 
of  these  columns  are  zero,  it  is  possible  to  suppress  them 
for  arithmetic  purposes.  For  computational  simplicity,  the 
matrix  actually  used  by  the  computer  program  deletes  all  the 
zero  columns  and  assumes  a corresponding  "reparameterized"  b 
vector  of  lover  dimension.  For  mathematical  rigor,  however, 
* 

the  X used  in  the  following  sections  retains  the  same 

number  of  columns  as  X.  This  restriction  will  be  eased  when 

* 

the  matrix  X is  actually  applied  to  the  computer  programs 

in  Appendix  B. 


G.  ESTIMABILITY  IN  THE  CONSTRAINED  MODEL 

It  can  be  shown  that  estimability  in  the  constrained 

* 

model  T ■ X b ♦ e follows  the  same  pattern  as  estimability 
in  the  full  model  I * Xb  ♦ e. 


* 

Theorem  5.  g'b  is  estimable  if  and  only  if  g'H  = g'. 
Proof:  By  definition,  g'b  is  estimable  if 
g'b  - t'E(I) , i.e.,  if 


19 


q'b  * t'X  b,  i.e.,  if 

* 

q'  * t’X  . Then 

* * * * * 

q'H  = t'X  G X 'X  , 

* * 

q'H  * t'X  = q' , 


and  if 


q'  * q'H  , 

* * * * * 

q'  = q'G  X 'X  . Let  t'  = q'G  X ', 


Then 


q'  = t'X  . 


This  result  allows  computations  to  be  performed 
directly  upon  the  constrained  matrix  in  order  to  examine  the 
estimability  of  proposed  hypotheses.  The  computer  program 
HYTEST  (Appendix  A)  can  accept  either  the  constrained  design 

* 

matrix  X (with  "zero"  columns  suppressed)  or  the  standard 

design  matrix  X as  an  input.  If  X is  used  for  input,  HYTEST 

* 

offers  the  option  of  using  either  the  constrained  matrix  X 
or  the  standard  matrix  X to  compute  tests  of  estimability. 


* * 
Mote  that  q'H  b is  always  estimable  since  q'H  b = 

* * * * * * * 
q'G  X 'X  b * q'G  X 'E(Y)  * t • E (Y)  where  t*  = q'G  X '. 


H.  TESTABILITY 


1 . the  hypothesis 


From  searle  [1],  all  linear  hypotheses  can  be 
handled  by  a general  procedure;  specific  hypotheses  are  then 
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considered  to  be  applications  of  this  general  procedure 


The  general  hypothesis 
where  K'  is  a matrix  of  s rows 


may  be 
and  p 


written  H : K'b  = m 
0 

columns.  The  only 


limitation  on  K*  is  that  it  have  full  row  rank.  That  is,  the 
hypothesis  must  be  composed  of  linearly  independent 
functions  of  the  parameter  vector. 


2.  analysis  of  variance 

To  review  analysis  of  variance  briefly,  classical 
techniques  rely  upon  the  ratio  of  two  independent  Chi-square 
distributions,  each  divided  by  its  respective  degrees  of 
freedom,  to  generate  an  F statistic.  The  sum  of  squares 
explained  by  the  model  if  the  hypothesis  is  assumed  true, 
divided  by  its  degrees  of  freedom  forms  the  numerator  of  the 
statistic.  For  many  situations,  the  denominator  is  the  sum 
of  squares  for  error  divided  by  its  degrees  of  freedom. 
Each  sum  of  squares  can  be  conveniently  represented  by 
appropriate  quadratic  forms  which  must  meet  certain 
requirements  in  order  to  be  Chi-square  distributed. 

Searle's  derivation  of  a test  of  the  general 

hypothesis  depends  upon  K'b  being  estimable  for  every  row 

k^'b.  If  this  assumption  is  satisfied,  the  quadratic  form 

20  -102 
Q/v  * (K'b  - m)'(K'GK)  (K'b  - m)/v 

is  distributed  non-central  Chi-square  and  has  rank  s.  The 

sum  of  squares  for  error  can  be  shown  to  be 

-1  -1 
SSE  » (Y-XK(K'K)  a)  • (I-XGX*)  (Y-XK  (X'K)  m)  . 

Q and  SSE  are  independent  so  F ( H)  * Q/s/SSE/  (n-r)  is 

distributed  non-central 
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, -1  2 

F'[  S,n-r,  (K'b-m)  • (K'GK)  (K»b-m)/2v  ] 

The  test  statistic  is 

0 -10 

F (H)  = (K'b  - in)'  (K'GK)  (K'b  - m) /s/SSE/ (n-r ) 
which  is  F distributed  with  s and  n-r  degrees  of  freedom 
under  the  null  hypothesis. 


3.  determination  of  testability 


Suppose  that 

K'b 

is 

not  estimable.  Then 

the 

hypothesis 

H : K'b  = m 
0 

is 

not 

testable.  Assuming 

that 

-1 

(K'GK) 

exists,  if 

one 

were 

to  compute  F (H)  , 

what 

hypothesis 

is  actually 

being 

tested?  Whjn  working  with 

the 

constrained 

linear 

model 

Y = 

* 

X b + e,  the  answer 

is 

* 

"K'H  b * * * * m."  The  derivation  which  follows  closely  parallels 

the  procedure  used  by  Searle  [1]  for  the  linear  model 
Y = Xb  ♦ e. 


* 

The  hypothesis  H;  K'H  b 

is  estimable  for  each  row  k '. 

i 

quadratic  fora  is 


* 

* a is  testable  since  K'H  b 
The  appropriate  numerator 


*0  * * * -1  *o 

Qi  * (K'H  b - a) '(K'H  G H 'K)  (K'H  b - a) 

* 0 * * * * * 

But  K'H  b = K'G  X 'X  G X 'Y  and  since 

* * * * * * 

X G X '»X  G *X  • , it  follows 

* 0 ***** 

K'H  b » K'G  X 'X  G 'X  'Y. 

* * * * * 

Let  G ^ * G X ' X G ' (which  is  a generalized  inverse  of 
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* * 

X 'X  ) . Then 


indistinguishable  from  those  that  would  be  performed  in 

* 

testing  the  testable  hypothesis  K'H  b = m. 
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IV. 


SYNOPSIS  OF  RESULTS 


A.  CONCLUSIONS 


1.  A given  linear  function  of  the  parameter  vector  (g'b) 
is  estimable  if  and  only  if  g'H  = g'. 

2.  Since  only  estimable  functions  are  testable,  if 
q'H  # q'  then  the  hypothesis  actually  being  tested  is  not 

H : q'b  = m but  rather  H : q ' Hb  = m. 

0 0 


3.  The  mathematics  developed  for  proving  both  of  the 
preceding  conclusions  can  also  be  applied  to  a constrained 
design  matrix,  such  as  that  used  in  the  BMD05V  program,  to 
allow  determination  of  estimability  directly.  That  is, 

• the  function  q'b  is  estimable  if  and  only  if 

* 

q'H  = q ' , and 

« if  q'b  is  not  estimable  then  the  hypothesis 


V ,,b 


m is  actually  testing  : 


* 

q'H  b = m. 


B.  SIGNIFICANCE 


The  results  presented  above  afford  a mathematical 
justification  of  the  need  for  a computer  program  to 
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r 


determine  the  testability  of  proposed  hypotheses.  A sample 
program,  HYTEST,  is  presented  in  Appendix  D.  Using  the 
HYTEST  program  permits  an  analyst  to  determine  not  only 
whether  each  of  his  hypotheses  is  testable,  but  also 
precisely  what  main  effects  and  interactions  confound  each 
hypothesis  that  is  not  testable.  Such  information  can  be 
used  to  make  more  informed  decisions,  a priori,  on  the 
design  of  experiments,  and  a posteriori  on  the  analysis  and 
interpretation  of  experimental  results.  Knowing  the  nature 
and  degree  of  confounding  may  not  necessarily  ease  decision 
making.  It  can  help  to  ensure  that  once  a hypothesis  is 
accepted  or  rejected  the  analyst  is  aware  of  the  degree  of 
"purity”  of  his  conclusions  concerning  the  effects  of 
various  factors  in  the  experiment.  If  used,  these  techniques 
can  prevent  an  analyst  from  complacently  assuming  that  he  is 
testing  one  hypothesis  when  he  is,  in  fact,  testing 
something  quite  different. 

11  ■ 


APPENDIX  A 


HYTEST 


HYTEST  is  a FORTRAN  IV  program  that  facilitates 
determining  if  a proposed  hypothesis  is  testable.  The  User's 
Guidt  contains  operating  instructions  and  a list  of  options. 


FLOWCHART 
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COMPUTE  GENERALIZED 
INVERSE  G 


SUBROUTINE 


In  the  interest  of  computational  accuracy  and  speed# 
HYTEST  uses  subroutine  LPSDOR  from  the  International 
Mathematical  and  Statistical  Library  (IMSL)  to  compute  a 
generalized  inverse  G.  This  is  a proprietary  subroutine. 
Under  the  licensing  agreement,  its  code  may  not  be 
distributed  to  or  used  by  a user  outside  the  Naval 
Postgraduate  School.  It  may  not  be  used  on  a non-NPS 
computer  system. 
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APPENDIX  B 
USER'S  GUIDE 


This  guide  contains  complete  operating  instructions  for 
using  the  computer  program  HYTEST. 


* 

the  constrained  design  matrix  X actually  used  in  the 

* 

computer  programs  BMD05V  and  HYTEST  is  the  same  X 

introduced  in  Chapter  III  except  that  all  zero  columns  have 
been  deleted  to  facilitate  computation.  This  has  the  effect 
of  reducing  the  parameter  vector  b;  an  analyst  using  these 
programs  is  cautioned  to  ensure  that  he  is  aware  of  which 
elements  have  been  "deleted"  from  the  parameter  vector.  See 
Chapter  III,  sections  D , E and  F,  for  further  information. 


A.  OVERVIEW  OF  COMPUTER  PROGRAM  HYTEST 


The  program  HYTEST  exploits  techniques  enumerated 
earlier  in  this  paper  to  determine  if  selected  hypotheses  of 

the  form  H : k 'b  * m are  testable  within  the  framework  of 

0 i 

a specified  linear  model.  If  an  hypothesis  is  not  testable, 
HYTEST  computes  an  algebraic  form  of  the  parameter  vector 
that  would  actually  be  tested  if  the  proposed  hypothesis 
were  to  be  employed. 
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Given  the  standard  design  natrix  (X)  from  the  linear 
model,  and  appropriate  additional  constraints,  HYTEST  can 

* 

compute  the  design  matrix  (X  ) for  use  in  the  8MD05V 

program.  If  desired,  DESIGN  cards  for  use  in  the  8I0HED 
package  can  be  produced  as  an  auxiliary  output.  If  the 
BIOMED  option  is  exercised,  HYTEST  performs  all  of  its 

* 

calculations  upon  X making  it  essential  for  the  user  to  be 

aware  of  the  exact  structure  not  only  of  the  original  design 

* 

matrix  X,  but  also  of  the  constrained  matrix  X . The  user 

can  select  the  option  to  have  all  calculations  performed  on 
X,  if  preferred. 


3.  INPUT  OPTIONS 


The  program  can  accept  either  the  standard  design  matrix 
from  the  basic  linear  model,  or  a BMD05V  design  matrix. 


C.  OUTPUT  OPTIONS 


1 • theses 


The  major  option  for  output  concerns  the  testability 
of  user  defined  hypotheses.  For  any  number  of  hypotheses 
from  zero  to  99,  HYTEST  will  determine  if  each  hypothesis  is 
testable,  and  if  not  testable,  the  program  computes  what 
would  actually  be  tested  if  the  specified  hypothesis  were  to 
be  employed.  This  option  is  automaticllly  suppresssed  if 
there  are  no  hypotheses  to  be  tested. 
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2 • design  cards 

If  a BIOMED  design  matrix  is  computed,  the  user  can 
select  an  option  that  will  punch  appropriate  DESIGN  cards 
for  BMD05V;  a printed  replica  of  the  cards  is  also  produced. 
If  punched  cards  are  not  required,  the  user  can  opt  for  the 
printed  form  of  the  DESIGN  cards  without  having  them 
punched.  This  output  option  is  only  available  if  a standard 
design  matrix  X is  used  for  input. 

3 . accuracy 

An  estimate  of  the  accuracy  with  which  HYTESI  is 
computing  the  generalized  inverse  matrix  G can  be  obtained 
upon  request.  The  output  consists  of  the  matrix  resulting 
from  subtracting  X'X  - X'XGX'X.  If  the  computer  were 
perfectly  accurate,  all  entries  would  be  zero.  Because  of 
arithmetic  inaccuracies  in  computing  G,  the  entries  are 
frequently  not  zero.  Since  the  matrix  H=GX'X  is  used  in 
assessing  the  testability  of  hypotheses,  X'X  -X'XGX'X 
affords  an  estimate  of  the  accuracy  to  be  expected  when 
determining  the  nature  of  the  hypothesis  being  tested. 

<*•  fl  f SllS 

The  H matrix  used  in  determining  the  testability  of 
selected  hypotheses  will  be  printed  if  requested.  Selecting 
this  option  will  generate  printed  output  for  all  of  the 
previous  options  as  well. 
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5.  generalized  inverse  G 

The  final  option  prints  out  the  generalized  inverse 
of  X • X used  in  computations  within  HYTEST.  This  option 
automatically  includes  printed  output  for  all  previous 
options  as  well. 


D.  INPUT  REQUIREMENTS 


1 . card  order 


Input  cards  must  be  in  the  following  order: 


PR03LM  card  (required) 

Design  matrix  cards  (required) 

AUXEQN  card  (required  if  standard 

design  matrix  input  option  is  used) 
Auxiliary  equations  (optional) 

Hypotheses  (optional) 

FINISH  card  (required  after  last  problem) 

2.  problem  cajd  (required) 

The  HYTEST  PROBLM  card,  based  upon  a similar  card 
used  in  the  BHD05V  program,  is  used  to  set  up  various 
program  parameters  and  options. 

DATA  COLUMNS  RESTRICTIONS 

PRC ELM  1-6 

User's  optional  problem  number  7-8 
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Number  of  design  card  sets 

9-11 

1 £ ND  i 150 

Number  of  columns  in  design  matrix 

12-13 

1 S NC  < 60 

Blank 

14-15 

Number  of  hypotheses 

16-17 

1 S NH  < 99 

Blank 

18-26 

Output  options 

27 

Blank 

2 8-71 

Input  options 

72 

0=  X matrix 

* 

1=  X matrix 

Output  options  are 

table. 

IF  COLUMN  27  CONTAINS 
0 

1 


2 


3 


selected  from  the  following 

THE  PRINTED  OUTPUT  WILL  INCLUDE 

A.  An  assessment  of  what  each 
hypothesis  is  testing 

B.  Design  cards  (if  the  3I0MED 
output  option  is  being  used) 
plus  option  0 

C.  Accuracy  of  coefficients  and 
option  0 

D.  The  H matrix  (H=GX'X); 

plus  all  output  from  options 
0 and  1 

E.  The  generalized  inverse  G 
plus  all  previous  options. 


3 . (jesjgp  matrix  cards  (required) 
a.  standard  design  matrix 

One  design  card  is  required  for  each  unique  row  of 
the  design  matrix.  The  first  2 columns  of  card  i contain 
(right  justified)  the  number  of  rows  in  the  design  matrix 
that  are  identical  to  row  i.  The  next  60  columns  (3-62)  are 
reserved  for  the  columns  of  the  design  matrix.  Enter  a zero 
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or  one  in  the  appropriate  card  column.  Each  card  column 
corresponds  to  a column  in  the  design  matrix  (not  to  exceed 
a total  of  60  columns) . 

example 

If  a hypothetical  design  matrix  were: 

110  0 
110  0 
10  10 
10  10 
10  10 
10  0 1 

Only  three  design  cards  are  required: 

021100  (in  columns  1-6) 

031010 

011C01 

b.  BMD05V  design  matrix 

Enter  the  BIOBED  design  matrix  (without  data  cards) 
exactly  as  it  is  used  in  the  BMD05V  package. 

4.  kUX EQN  card  (required  if  the  standard  design  matrix 
lS£Ut  is  ssedl 


DATA 

COLOBNS 

RESTRICTIONS 

AUXEQN 

1-6 

Number  of  auxiliary 

equations 

7-8 

0 < NA  < 99 

Punched  card  output 

option 

9-11 

1=Design  cards 

Punched 
0-Design  cards 
Hot  punched 

Caution:  Do  not  use  this  card  if  the  BIOMED  input  option  is 
exercised. 

If  there  are  no  auxiliary  equations,  the  program 
will  perform  its  computations  on  the  standard  design  matrix. 
None  of  the  EIOHED  output  options  are  then  available. 
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5 . Auxiliary  equations  (optional) 

Auxiliary  equations  are  algebraic  equivalents  of 

3MD05V  constraints.  For  the  example  to  be  used  in  this 

section  assume  the  linear  model 

Y = u + a ♦ e i= 1 , 2,3 

ij  i ij 

Each  auxiliary  equation  (m)  requires  a separate 
series  of  input  cards.  The  first  card  contains  the  number 
of  parameters  in  equation  m.  As  an  example,  the  auxiliary 
equation 

a ♦a  + a * 0 
1 2 3 

contains  three  parameters  so  the  first  card  for  this 
equation  would  contain  03  in  the  first  two  columns. 

The  second  card  contains  the  column  number  and  the 
coefficient  of  the  parameter  whose  column  is  to  be  deleted 
from  the  BMD05V  matrix.  For  instance,  the  parameter  vectotfV 
in  this  example  is 

b'  = (u,  a , a , a ) 

12  3 

Parameter  a^  corresponds  to  the  fourth  column  of  the  design 

matrix  X.  In  order  to  eliminate  a from  the  BMD05V  matrix, 

3 

the  second  card  must  contain: 

0401 

in  the  first  four  columns. 


A separate  card  is  required  for  each  parameter  in 
the  auxiliary  equation,  so  this  example  needs  two  more 
cards: 

0201 

0301 


corresponding  to  the  columns  and  coefficients  for  a and  a 

1 2 

respectively . 

Caution:  if  a parameter's  column  is  to  be  deleted 

from  the  BHD05V  design  matrix,  that  parameter  must  not  be 
used  in  an  auxiliary  equation;  its  algebraic  equivalent  must 
be  used  instead.  & complete  set  of  cards  must  be  included 
for  each  separate  auxiliary  equation  (m  cards  for  each 
auxiliary  equacion  with  a parameters) . 

The  above  input  will  produce  the  following  EMD05V 
design  cards: 

DESIGNOO 200 1 001000 
DESIGN 00 300  1C00001 
DESIGNOO  10010-10-1 

6 . Hypothesis  cards  (optional) 


The  format  for  hypothesis  cards  is  similar  to  that 
for  the  auxiliary  equation  cards.  Assume  that  the  hypothesis 
of  interest  is: 

H : a - a =0 
0 1 2 


‘2* 


There  are  two  parameters  in  the  hypothesis  (a^  and 
correspcnding  to  columns  2 and  3 in  the  constrained 


BH^05?  matrix.  It  is  important  to  note  that  the  parameters 
used  in  hypothesis  testing  must  be  associated  with  the 
matrix  used  to  compute  H = GX'X.  If  the  standard  design 
matrix  X is  used  throughout  the  program,  the  column 
corresponding  to  a specific  parameter  is  unchanged  from  the 
original  model  formulation.  If,  on  the  other  hand,  the 
standard  design  matrix  X is  used  for  input,  and  auxiliary 
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* 

eguations  are  used  to  reduce  X to  the  BHD05 V matrix  X , then 

the  columns  used  in  the  hypothesis  cards  must  be  the 

* 

appropriate  columns  from  X . The  first  card,  for  this 

example  hypothesis,  contains  02  (for  two  parameters)  in  the 
first  two  columns.  The  second  and  third  cards  contain 
021.0 
03-1.0 

respectively.  The  first  two  columns  identify  the 
parameter's  column  in  the  appropriate  design  matrix.  The 
parameter's  coefficient,  a decimal  point  and,  if 
appropriate,  a minus  sign,  must  be  punched  in  columns  3 
through  11. 

7 • FINISH  car d (Required  after  last  problem) 

If  several  problems  are  to  be  run  in  sequence,  the 
cards  for  each  problem  are  to  be  grouped  in  sequential 
blocks.  Once  HYTEST  finishes  a problem  it  determines  if 
another  problem  is  to  be  run.  If  so,  it  executes  that 
problem.  It  will  continue  executing  problems  in  sequence 
until  all  problems  are  completed  and  a FINISH  card  is 
encountered.  The  FINISH  card  must  be  the  last  card  in  the 
data  deck  and  the  word  FINISH  must  be  punched  in  the  first 
six  columns. 


' 


I 
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APPENDIX  C 


EXAMPLE 


A.  THE  MODEL 


The  use  of  HYTEST  can  best  be  illustrated  through  use  of 
an  example.  The  one  chosen  is  from  Searle  [ 1 ]. 

The  model  is: 


ijk 


m ♦ A ♦ B + AB  + e i=1,2,3 

i j ij  ijk 

j = 1/  . ../4 
k=  1 ,.  . . ,4 

Y is  the  kth  observation  of  the  ith  treatment  of  the  jth 
ijk 

type;  H is  the  mean  effect;  A is  the  effect  of  the  ith 

i 

t* 

treatment;  B is  the  effect  of  the  jth  type;  A3  is  the 

3 ij 

interaction  between  the  ith  treatment  and  the  jth  type  and 

e is  the  error  term.  The  number  of  observations  noted 
ijk 

for  each  cell  is  shown  in  the  following  table: 


j*1  j=2  j*3  j=4 

i»1  3 0 1 2 

i«2  2 2 0 0 

1*3  0224 
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The  X matrix  is 


110  0 10 
110  0 10 
110  0 10 
1 1 0 0 0 0 
1 1 0 0 0 0 
1 1 0 0 0 0 
10  10  10 
10  10  10 
10  10  0 1 
10  10  0 1 
10  0 10  1 
10  0 10  1 
10  0 10  0 
10  0 10  0 
10  0 10  0 
10  0 10  0 
10  0 10  0 
10  0 10  0 


0 0 1 0 0 0 
0 0 1 0 0 0 
0 0 1 0 0 0 
1 0 0 0 1 0 
0 1 0 0 0 1 
0 1 0 0 0 1 
0 0 0 0 0 0 
0 0 0 0 0 0 
0 0 0 0 0 0 
0 0 0 0 0 0 
0 0 0 0 0 0 
0 0 0 0 0 0 
1 0 0 0 0 0 
1 0 0 0 0 0 
0 1 0 0 0 0 
0 1 0 0 0 0 
0 1 0 0 0 0 
0 1 0 0 0 0 


0 

0 

0 

0 

0 

0 

1 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 


0 0 0 0 0 
0 0 0 0 0 
0 0 0 0 0 
0 0 0 0 0 
0 0 0 0 0 
0 0 0 0 0 
0 0 0 0 0 
0 0 0 0 0 
1 0 0 0 0 
1 0 0 0 0 
0 0 0 0 1 
0 0 0 0 1 
0 0 0 0 0 
0 0 0 0 0 
0 0 0 0 0 
0 0 0 0 0 
0 0 0 0 0 
0 0 0 0 0 


Suppressing  the  zero  columns  for  AB  , AB  , kb  , 

12  23  24 

AE^  the  input  design  matrix  for  HYTEST  is 

03 1100100010000000 
011100001001000000 
021 100000100100000 
021010100000010000 
021010010000001000 
021001010000000100 
021001001000000010 
04100  1000100000001 


0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
1 0 
1 0 
0 1 
0 1 
0 1 
0 1 


and 
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B.  CARD  PREPARATION 


The  size  of  the  BMD05V  matrix  is  controlled  by  the  type 
of  constraints  used  in  the  BIONED  package.  In  general,  the 
number  of  parameters  for  each  main  effect  and  each 
interaction  effect  is  reduced  by  1 in  each  dimension.  In 
this  example,  1 parameter  is  deleted  for  each  of  the  main 
effects,  A and  B.  Since  i = 1/2,3  and  j = 1,2, 3, 4 there  are 
3 x 4 = 12  interaction  terms.  Reducing  each  dimension  by  1 
yields  i = 1,2  and  j = 1,2,3  for  2x3=6  remaining 
interaction  terms.  As  noted  in  the  preparation  of  the  input 
design  matrix  above,  the  columns  for  four  interaction  terms 
are  zero.  Those*  four  terms  are  therefore  deleted  from  the 
model.  Two  other  interaction  and  two  main  effect  terms  can 
be  deleted  by  addition  of  the  usual  BMD05V  constraints.  For 
this  example,  the  following  constraints  were  adopted: 


(1) 

A ♦ 

A t A =0 

1 

2 3 

(2) 

B + 

B ♦ B ♦ B 

= 0 

1 

2 3 4 

(3) 

AB 

♦ AB  ♦ AB 

♦ AB 

11 

12  13 

14 

(4) 

AB 

♦ AB  ♦ AB 

♦ A3 

31 

32  33 

34 

= 0 
= 0 


In  auxiliary  equation  (1)  it  is  desired  to  delete 
parameter  A which  corresponds  to  column  4 of  the  design 

matrix.  Since  there  are  three  parameters  in  this  equation 

the  first  card  is: 

03 

The  other  cards  are: 

0401 
0 201 
0301 

This  completes  the  cards  for  auxiliary  equation  (1) . 
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Auxiliary  equation  (2)  is  quite  similar.  If  B is  chosen 

4 

for  deletion  the  cards  are: 


0801 

0701 

0601 

0501 


Auxiliary  equation  (3)  contains  the  parameter  AB^  wkich 

does  not  appear  in  the  desiqn  matrix  (it  is  a column  of 

0's) . Under  the  BMD057  constraints,  it  has  an  algebraic 

equivalent,  AB  = -AB  - AB  , which  must  be  used  in  its 
12  22  32 

place.  Equation  (3)  then  becomes 

(3a)  AB  - AB  - A3  ♦ AB  ♦ AB  = 0. 

11  22  32  13  14 

The  input  cards  used  to  delete  AB  from  the  design  matrix 

14 


14-1 


091  1 


The  interaction  term  AB^  from  equation  (4)  , does  not 

appear  in  the  design  matrix.  Substituting  AB  * - AB 

31  11 

AB^  into  equation  (4)  yields: 

(4a)  -AB  - AB  ♦ AB  ♦ AB  ♦ AB  * 0. 

11  21  32  33  34 

The  input  cards  used  to  delete  AB  from  the  design  matrix 

34 


f 
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parameters  from  the  standard 
BMD05V  matrix  of  twelve  non-zero 


05 

1601 

09-1 

12-1 

1401 

1501 


Deleting  a total  of  eight 
design  matrix  will  yield  a 
columns. 


123456  7 8 9 

MAAEBBAB  AB  KB 

1 2 1 2 3 1 1 13  21 


10 

1 1 

12 

AB 

AB 

Afi 

22  32  33 


C.  HYPOTHESES 


TO 

that  A 

1 


verify  that 

-A  is  not, 
2 


A 

1 

the 


- A ♦ AB  - AB 
2 11  21 

following  cards  are 


is  estimable  but 
used: 


04 

021.0 

03-1.0 

071.0 
09-1.0 
02 

021 .0 

03-1.0 


After  completing  the  PBOBLM  card,  the  AUXEQN  card  and 
the  FINISH  card  as  outlined  in  the  User's  Guide  and  putting 
the  cards  in  the  correct  order,  one  is  ready  to  run  HYTEST. 


The  outputs  of  major  interest  are  the  DESIGN  cards  and 
the  hypotheses.  In  the  interest  of  brevity,  only  those  two 
outputs  will  be  illustrated. 


42 


■ 


0.  SAMPLE  00TP0T 


Design  cards  suitable  for  BMD05V  will  be  similar  to  the 
sample  shewn  below. 

DESIGN  3110100100000 
DESIGN  1110001010000 
DESIGN  2 1 1 0 -1  -1  -1  -1  -1  0 1 1 0 
DESIGN  2101100001000 
DESIGN  2101010000100 
DESIGN  21-1-1010000010 
DESIGN  21-1-1001000001 
DESIGN  4 1 -1  -1  -1  -1  -1  1 0 1 0-1  -1 


The  column  on  the  left  below  represents  the  vector  g'  in 
the  hypothesis  H:  g'b  * m.  The  column  on  the  right  below 
represents  the  vector  q'h.  If  g'  = q'H,  the  hypothesis  is 
testable.  If  q'  # g'H,  the  hypothesis  is  not  testable. 

THE  HYPOTHESIS 
O?  INTEBEST  IS 
(HO:  Q'B  = M) 

0.0 
1.0 
-1  .0 
0.0 
0.0 
0.0 
0.0 
0.0 
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THE  HYPOTHESIS  WHICH  IS 
ACTUALLY  BEING  TESTED 
(HO:  Q'HB  = M) 

0.115 

0.748 

-0.656 

-0.084 

-0.298 

0.038 

0.221 

0.099 


0.0  -0.374 

0.0  -0.160 

0.0  0.275 

0.0  -0.061 

MITHIN  THE  LIMITS  OF  COMPUTATIONAL  ACCURACY, 
HYPOTHESIS  1 CANNOT  BE  TESTED  AS  STATED  ABOVE 


THE  HYPOTHESIS 
OF  INTEREST  IS 
(HO:  Q'B  * M) 
0.0 
1.0 
-1.0 
0.0 
0.0 
0.0 
1.0 
0.0 
-1.0 
0.0 
0.0 
0.0 


THE  HYPOTHESIS  WHICH  IS 
ACTUALLY  BEING  TESTED 
(HO:  Q'HB  = M) 

0.0 
1.0 
-1.0 
0.0 
0.  0 
0.0 
1.0 
0.0 
-1.0 
0.0 
0.0 
0.0 


0.0  0.0 
HITHIN  THE  LIMITS  OF  COMPUTATIONAL  ACCURACY, 
HYPOTHESIS  2 CAN  BE  TESTED  AS  STATED  ABOVE 


As  expected  (Searle  £1]),  one  hypothesis  is  estimable; 
the  other  is  not.  Additional  output  inforaation  is 
available  froa  HYTEST  if  desired.  The  User's  Guide 
enuaerates  all  of  the  options. 
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V 

APPENOIX  D 


CCMPUTER  PROGRAM 


RE AL*8  EP,BC,0F 

DIMENSION  X ( 1 50. 6 C ) , NRE  PS ( 150)  ,XS(150,60) 

Cl  PENSION  XR l 60 ) » IC(60),XSX(60,6C)» wKARE A ( 3 £4C ) » Q ( 60  ) 

CIMENSICN  CH(60),  H(60,60l 

CATA  0P / ' PROBLM  ' / ,B F/ 'F IN ISH  •/ 

IOOC  FCPMAT  COS 'PROBLEP  NUMBER  •»  I 4 ) 

1001  FORMAT  ('O'  * ' NUMBER  CF  DESIGN  CARC  SETS'  * 16  ) 

1002  FORMAT  ('0','NUMBER  GF  INDEPENDENT  VARIABLES  IN  DESIGN 
1 MATRIX' *15) 

IOC 2 FORMAT  COS'NUMBER  OF  INDEPENDENT  VARIABLES  IN  8MC05V 
1 MATRIX'. 15) 

101C  FORMAT  ('OS'  WITHIN  THE  LIMITS  CF  COMPUTATIONAL  ACCURA 
ICY  , • ) 

1011  FCPMAT  C s 'HYPOTHESIS'. 13, 2X,  'CAN  BE  TESTED  AS  STATE 
1C  ABOVE • ) 

1012  FORMAT  (•  • . • HYPOTHES I S ' 1 1 3 , 2X , • C ANNOT  BE  TESTED  AS  ST 
IATEC  ABOVE') 

1026  FCPMAT  (• 0 S' GENERALI  ZED  INVERSE  MATRIX,  G'/) 

1027  FORMAT  I'O't'H  MATRIX  ( GX"X  ) • / ) 

102C  FCFMAT  C S25F5.1) 


1035 

1036 

1037 

1040 

1041 

1050 

1051 


1060 

1061 

1062 

1063 

1066 

107C 

1071 


CO', 'ACCURACY  CF  i 

C SiO(3X,F10.5)) 
CO') 


FORMAT  ('O', 'ACC 
1/) 

FCFMAT  (•  ',10(3 
FORMAT  CO') 

FORMAT  ( • 0 ' , ' INPUT 
FORMAT  CO'.'INPUT 
FORMAT  (A6 , 12,13 
FCPMAT  COS'FIN 
1LT  OF  SEQUENCE. 
2TING.') 

FCPMAT  COS'THE 
1 IS') 

FCPMAT  C S5X.F 
FORMAT  C ',' OF 
1C) 

FORMAT  f S' (HO 


CuEFFI C I ENTS 


;«x  - xmxgx"x ) ' 


OPTION: 

CPTION: 


STANCARC  DESI 
BMD05V  CESIGN 


- ,■  imru  i uriiuiNi  cruusv  uciUjiN 


PRGGRAM  CANNOT  CONTINUE 


GN  MATRIX' ) 
MATRIX* ) 

PRCBLM  CARC  C 
ANC  IS  TERM  INA 


HYPOTHES IS' tlOX, 'THE  HYPCTMESIS  WHICH 


('  ',5X,F7.3,iflX,F7.3  ) 

C S ' Or  INTEREST  I S S 1 OX  , • ACTUAL 


M)  SlOX,  '(HO: 

, II  . • fiKhCT  ftl 


FORMAT  f S'(HO:  Q"B  * M)  • . 10X,  • (HO: 

FORMAT  COS 'HYPOTHESIS',  I3t‘  CANNOT  BE 
1 CC-0  IS  SINGULAR') 

FORMAT  (6X,I2,13,12,2X,I2,8X,I2,44X,I1) 
FORMAT  ( I 2 , 6Cr 1. 0 ) 


LY  BEING  TESTE 

Q"He  = M ) * / / ) 
TESTED  BECAUSE 


1072  FCPMAT  (6X, I3.21F3  .0/6X,22F3.0 ) 

1075  FORMAT  (I2.30F2.0) 

1076  FORMAT  C S • OE  S IGNS  1 3 ,21 1 3 / 7X,22I3) 

1077  FORMAT  COS'DESIGN  CARDS'/) 

1078  FORMAT  C OES IGN* , 1 3 ,21 13/ • OES 1GN S 2213/ • 

1079  FORMAT  II2.F9.0) 

1C9C  FORMAT  COS'CESIGN  CARD  NUMBER S 14, 2X, • 
C READ  IN  THE  DESIGN  MATRIX  ANO  THE  PROBLEM  CA 
RE  AC  (5,1070)  NP,NO»N»NH»IPR,IC 
227C  WRITE  (6,1000)  NP 


CES IGN' ,2213) 

IS  1NVALI0') 
PC 


227C 


7C  WRITE  (6,1000)  NP 
WRITE  (6,1001)  NO 
WRITE  ( 6 £ 1002  I N 

THIS  ^REAoi  IN^THE*  BIOMEO 


2305 

0ES1GN 


MATRIX 
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PEAC  (5.1072) 
IF  (NREPS  (I) 
2300  CCMINUE 
NS*N 

GC  TO  2365 
C THIS  BEADS  IN  THE 


NREPS(I),  (XS ( I , J ) , J*1 ,N  ) 
.LE.  0)  GC  TO  2250 


STANDARD  CESIGN  MATRIX 


2305  WRITS  (6,1040) 

DC  2310  I * 1 *ND 

BE  AC  (5,1071)  NREPS  (I  ),  (X(I  . 
IF  (NREPS  (I)  .LE.  0)  GC  TO  2250 
23  1 C CONTINUE 

C CCNPLTE  THE  BMC05V  MATRIX  (X*) 

BEAO  (5,1070)  NA.IPCH 
NS*N-NA 

IF  (NA  .NE.  0)  GO  TO  2315 
CC  2313  1*1, NO 
DC  2313  J*1,NS 
XS(I,J)*X(I,J) 

CONTINUE 

GC  TO  2365 
WRITE  (6,1003) 


JR  ) , JR* 1 , N ) 


2313 


2315 


2315  WRITS  (6,1003)  NS 
C READ  IN  EACH  AUXILIARY  EQUATION  : 
C ANC  CC  THE  eASIC  ROW  OPERATIONS 
CC  2330  I S* 1 , NA 
DC  2320  I R*1 »N 
XR  ( IR)*0.0 
232C  CONTINUE 

READ  ( 5 , 1C75 ) NVAL 
NVAL*NVAL-l 

READ  (5,1075)  ID(IS)»XVAL 
XP(IC(  IS) )*XVAL 
CC  2323  I V=  l , NVAL 
READ  (5,1075)  NL.XVAL 
XR(NL)*XVAL 
2322  CONTINUE 

DC  2325  IC*1 ,ND 
I F ( X ( 1C , ID ( IS  ) ) • EQ.  0.0)  GO 


TURN 


2324 

2325 
223C 

c su 


CC  2324  JX*1 , N 
X(IC,JX)*X( IC,JX)-XB( JX) 
CONTINUE 


2325 


CCNTINUE 
CONTINUE 
BESS  ZERO 


SUPPRESS 

JR«1 


ROWS 


COLUMNS 


CC  2340  JT*1,N 
CC  2335  JX-l.NA 

IF  (ID(JX)  .£0.  JT)  GC  TO  2340 

2335  CCNTINUE 

CC  2336  IS-L.ND 
XS(IS,JR)*X(IS,JT) 

2336  CCNTINUE 
JB* JR+1 

224C  CCNTINUE 

C OUTPUT  eMOOSV  CESIGN  CAROS 

IF  (IPR  .LT.  1)  GC  TO  2365 
WRITE  (6, 1C77) 

CC  2360  J*l,NO 

fE<HU  K‘l,NS 

IF  (XS( J,K ) .LE.  -0.5 ) ID(K  )*  -1 
IF  (XS( J ,K ) .Ge.  0.5)  IO(K)>  1 
2355  CCNTINUE 

WRITE  (6,1076)  NREPS  (J  ) , ( 1 C (K  ) ,K*1 , NS ) 

IF  (IPCH  .LT.  II  GC  TO  2360 

WRITE  (7,1078)  NREPS ( J ) , ( IC(K) ,K*1,NS) 


2355 


-0.5) 

0.5) 


IO(K  )* 
10 (K ) * 


[36C 


CCNTINUE 
LIE  X*X»XS 
CC  2380  1*1, NS 
DC  2380  K-l.NS 

x|xTi.k)*o.o 

CC  2380  J* I , NO 
NF IX*NREPS( J) 


46 


_ r-  . 'r  ” ■ * ‘ ■'  - 


238C 


239C 


2392 


2393 


CC  2380  J F IX=1 • NF  IX 
X!X(I,KI=XS(  J,I)*XS(J,K)+XSX(I,K) 

8C  CCNTINUE 

CALL  LPSDOR  (XSX,NS,NS,60,  F.5 , WKAREA, I ER  ) 
TRANSPOSE  Tl-E  PSEUDO-INVERSE  TO  GET  TFE  GENERAL 
AVERSE  (IN  X) 

CC  2390  J=1,NS 
CC  2390  1=1, NS 
X(J,I)=  H ( I , J ) 

9C  CCNTINUE 

IF  C I PR  .LT.  3)  GO  TO  2395 
WRITE  (6,1026) 

NW  = 1 

92  NENC*NW+9 

IF  (NEND  .GT.  NS)  NEND=NS 
CC  2393  IW  = 1 ,NS 

WRITE  (6,1036)  ( X ( I W , JW ) , J W=NW  , NEND ) 

93  CCNTINUE 
WRITE  (6,1037) 

NW  *NW+10 

IF  (NEND  .LT.  NS)  GC  TC  2392 
COMPUTE  P = GX'X 


NEND=NS 


2395  CC  2400  1=1, NS 
DC  2400  K* 1 , NS 
KI,K)  = 0.0 
CC  2400  J=1 , NS 

H(I,K)=X( I , J) *XSX( J,K)+H( I , K ) 

2400  CCNTINUE 

IF  (IPR  .LT.  2)  GC  TO  2405 
WRITE  (6,1027) 

N W = l 

2<«G2  NENC=NW+9 

IF  (NENC  .GT.  NS)  NEND=NS 
CC  2403  I W = I , NS 

WRITE  (6,1036)  ( H ( I M , JW ) , JW=NW , N ENC ) 
2403  CCNTINUE 

WRITE  (6,1037) 

NW*N«+1C 

IF  (NENC  .LT.  NS)  GO  TO  2402 
C CFECK  F eY  COMPUTING  X'XGX'X 


2<«G2 


2403 


2405 


2410 


2415 


24ie 


2420 


C INPU 
2422 


2423 


2425 


IF  (IPR 
CC  2410 
CC  2410 
XS(I*K) 
CC  2410 
XS ( I « K) 


.LT.  1) 
I ■ l , NS 

.s:j-ns 

J»i,NS 


CL  24 10  J* 1 , NS 
XS(I,K)«XSX(I  ,J)*H(J, 
CCNTINUE 
CC  2415  1=1, NS 


2422 


K ) +XS ( I,K) 


DC  2415  J= ltNS 
XS(I,J)«XSU,J)-XSX(  I,J) 
CCNTINUE 
WRITE  ( 6 , 1 C35 ) 

NW  = 1 

NEN0>NW*9 
IF  (NEND  .GT.  NS) 

CC  2420  IWI.NS 
WRITE  ( 6 , 1C36)  (X 
CCNTINUE 
WRITE  (6«1C37) 

NV-NW+IC 

IF  (NH  .EQ.  0)  GO 
CC  2200  LCCP«1,NH 
CC  2423  1=1, NS 

cm-o.c 

CCNTINUE 

RE AO  (5,1075)  NHY 

suns, 
m ar 


[NUE 

: ( 6 , 1C35 ) 


S)  NEND-NS 

(XS(IW,JW),JW*NW,NEND) 


GO  TO  2418 
TO  EE  TESTED 
TO  9999 


) NHYP 
NHYP 

) NQ,HVAL 
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GC  TO  2450 


C CFECK  TC  ENSURE  THAT  Q'GQ  IS  NONSINGULAR 
OECK*O.OOOl 
CC  2430  K*1 f NS 
XR(K)«0.0 
CC  2430  J»1,NS 
XR(K)=Q( J)*X( J,K)+XR (K) 

2430  CONTINUE 
KLC-0.0 
DC  2440  K* 1 , NS 
KLD*XR(K)*C(K)+H0L0 
244C  CONTINUE 

IF  ( ABS ( HOLD ) .GT.  CHECK) 

WRITE  (6j  1066)  LOOP 

245C  WRITE  (6,1060) 

•FITE  (6,1062) 

WRITE  (6,1063) 

C HLLTIFLY  Q • H 

CC  2210  K3 l , NS 
>CF(K)»O.C 
CC  2210  J*1,NS 

cmkT»q(j)*h(j,k)+ch(k) 

22 1C  CONTINUE  ^ 

CC  2215  IW%,NS 
WRITE  (6,1061)  Q(IW),  QH(IW) 

22 15  CONTINUE 

WRITE  (6,1010) 

CC  2460  K*1 , NS 
CA»AeS(C(K)) 

CFA  = ABS(QMK)) 

IF  (CHA  .GT.  QA+CHECK  .CR.  CHA 
‘ GC  TO  247C 


•LT.  QA-CFcCK) 


246C 


247C 

220C 

225C 

9999 


900C 


CONTINUE 
WRITE  (6,1011)  LCCP 
GC  TO  2200 
WRITE  (6,1012)  LCCP 
CONTINUE 
GC  TC  9599 
WRITE  (6,1090)  I 

R 5 AC  ( 5,1050)  BC , NP , NO , N ,NH  , I PR  , I C 
IF  (BC  .EQ.  BP)  GO  TO  2270 
IF  (BC  .EQ.  BF)  GC  TO  9000 
WRITE  ( 6 , 1C51 ) 

S1CP 

ENC 
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